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The present invention relates to sequences of 
nucleic acids of mycobacteria belonging to the M. 
tuberculosis complex. 
10 The invention likewise relates to a method of 

in vitro detection of Strains Of mycobacteria belonging 
to the M. tuberculosis complex as well to a method 
for a differential diagnosis or Strains of the M. 
tubus culosis complex, especially for differentiating 
lb the presence of the BCG from that of other members of 
the complex in a sample. 

Approximately, 1.7 billion people cr 1/3 of the 
world population are infected with M. tuberculosis 
fSudre et al., 1992). In 1990, ths eslii,u»ted number of 
20 cases of tuberculosis was 8 million, Including 2.3 
million deaths (Sudre et al., 1992). These last few 
years, the number of cases of tuberculosis in the 
United Stales and in Europe hoc increased by 3 to fi% 
per annum, principally in populations at high risk such 
25 as patients suffering ftoia aids, chronic alcoholics, 
the homeless and drug addicts (Barnes et al., 1991). 

Taking account of the difficulties i;j fighting 
infections by mycobacteria, there is an urgent need to 
be able to have a specific and sensitive rapid method 
30 allowing these infections to be diagnosed. Also, the 
early detection of M. tuberculosis in clinical samples 
is taking on growing importance in the control of 
tuberculosis both for the clinical treatment of 
infected patients and for the identification of exposed 
35 individuals at risk. 

Detection oy fck {Polymerase Cham Reaction) of 
specific CNR of species of mycobacteria is probably one 
. or che most promising novel approaches for rapid 
specitrc and sensitive diagnosis (saiki et al., 196 5 ' 
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Brisson-Noel et al., 1959; i Cousins et al . , 1312; 
Eisenach ct al., 1990; Forbes et al. 1993; Fries et 
al., 1991; Hermans et al., 1990; Jonas et al., 1993; 
Kolk et al., 1992; Pierre et al., 1991; Saboor et al., 
1992; Shankar et al., 1991; Sjobrmg et al., 1990). The 
different studies carried out up to date, however, have 
led to different results as far as the specificity and 
the sensitivity aie concerned. Among the reasons for 
this diversity, It is especially possible to note 
methodological differences concerning the preparation 
of the samples, the protocol followed for the 
amplification or the methods of detection of the PCR 
products . 

Several of these studies relate to the 
detection by PCR of the M. tnkarcnlosis complex 
starting from the target DNA IS6110 (Clarridge et al., 
1993; Eisenach et al., 13S0 ; Forbes et al., 3 993; 
Nocrdhoeck et al., 1994), the gene coding for the 55 
kDa antigen (Brisson-Noel et al., 1991; Telenti et al., 
1993), the gene coning for Che 38 kDa antigen 
(Folgueira ct al., 1993; Sjobring et al . , 1390) or the 
liposomal 1SS RNA (Kox et al., 1995). However, all 
these diagnostic tests identify the M. tuberculosis 
complex in its entirety. 

The M. tuberculosis complex comprises 
M, tuberculosis, M. bovis, Af. microti and M. africanum. 
These four species have strong homologies in their DNA 
(35 to 100*1 (Imeada, 1985) and have several totally 
identical genes. This homology has limited the use of 
DNA sequences to differentiate the strains. It would 
nevertheless be of particular interest to be able to 
differentiate the strains of M. tuberculosis from those 
of the Calmette-Guerin (BCG) bacillus u. bovis, the 
latter often being used as live vaccines for 
xmunoprotecUw, ayainst tuberculosis . obviously, thcrc 
« thus an interest in distinguishing between poocibly 
P«lhoyeuic mycobacteria and vaccinating DCG strains. 
This disLiucLioa is particularly important in the case 
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of iiwrminodeficient individuals, such as subjects 
infected by HIV. 

Restriction fragment: length polymorphism |R?LP) 
analyses based on the insertion of Ihe 136110 elements 
have been used to differentiate different strains of 
M. tubeiculosis. It has been shown that this insertion 
element allowed specific RFLP profiles of strains to be 
obtained on account of a variability in ins 
localization and i:i the number of copies existing in 
the genomes of different strains. IS6110 has been 
demonstrated in M. tuberculosis and in M. bovis but not 
in -he other mycobacteria tested (Cave et al., 1921). 
In general, several copies of this insertion element 
can be demonstrated in M. tuberculosis, although a 
single copy is found in M. bnvis BCG (Cave et al., 
±991 ) . However, certain strains of M. tuberculosis are 
devoid of 155110 (van Soolingen et al . , 1994) and 
strains of M. bovis having a significant number of 
copies are common in some populations (van Soolingen et 
al., 1994). Added to these limitations is the fact that 
the identification of strains of M. tuberculosis from 
ISS110 necessitates Southern Blot analyses and cannot 
easily be carried out by routine PCR methods. 

The authors of the present invention have 
demonstrated a novel target DNA sequence for enzymatic 
amplification. This sequence is part of an operon 
coding for a specific two-component regulatory system 
of M. leoraa and bacteria which are -members of the 
M. tuberculosis complex. The two-component systems are 
regulatory .systems belonging to a large family, 
involving two proteins which cooperate in translating 
external signals by modifications in the level of 
genetic expression (Parkinson and Knfoid, 1992). 
According to a proposed general model, one component 
localized in the raerabrane could act as a sensor of 
environmental changes and could transmit the 
information to a regulatory component of the response 
xn the cytoplasm, which in turn could modulate the 
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transcription of tarqet genes. The communication 
between the two components is generally carried out by 
a cascade of phosphorylations. 

The authors of the present invention have at 
> present cloned and characterized a two -component 
mycobacterial system. This system appears to be 
specific to members of the M. tuberculosis complex and 
to M. leprae. Unexpectedly with respect to the other 
two-component systems, the gene3 of this mycobacterial 
system are separated by DNA sequences (intercistroni c 
" sequences or repeated seauences) which are uniquely 
present among the strains of mycobacteria of the 
M. tuberculosis complex and in M. leprae. 

The authors of the present invention have 
additionally shown that in the strains of the 
M. tuberculosis complex, this interc* stronic region 
corresponded Co an exact or truncated number cf 
repetitions of a sequence of 77 base pairs, SEQ ID 
No. 1, which was able to vary among the strains. The 
truncated sequence (SEQ ID No. 2) is composed cf S3 
base pairs and contains a short in-phase internal 
deletion of nucleotides Nos, 40 to 66, which die 
substituted by a GAG codon. 

The authors of the invention have likewise 
shown that, surprisingly, the truncated sequence (SEQ 
ID No. 2} was characteristic of the members of the 
M. tuberculosis comple* which are different from BCG. 

In M. leprae, the corresponding inrercistronic 
region is formed by a variant of 52 base pairs already 
additionally described and whose sequence is indicated 
belcw: 

5' atg aca ccc gcg cag gcg atg atg cag age gaa gtg acq 
age ggg aat gtg a 3 ' . 

Taking account of their characteristics, these 
repeated sequences. are of particular interest for 
identifying and differentiating between strains of the 
M. tuberculosis complex by enzymatic amplification 
techniques, such as PCR or other similar methods. More 
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particularly, they allow a differencial diagnosis 
between the presence uf BCG* and that of other members 
of the complex In a biological sample tc be 
established- This diffeiential diagnosis, whose 
principle is based on the specific detection of the 
sequence SEQ ID No. 2, forms a preferred aspect of the 
invention. It is advantageously employed lo distinguish 
an infection by BCG from an infection by other virulent 
members of the complex in imaunortef f icient individuals, 
such as especially subjects infected by HIV. 

The invention thus relates to a specific 
fragment of nucleic acids of mycobacteria belonging to 
the M. tuberculosis complex, comprising a sequence of 
nucleotides chosen from amongst the sequence SEQ ID 
No. 1, the sequence PEQ ID No. I, their complementary 
sequences or the sequencer of nucleic acids capable of 
hybridizing with one of the preceding sequences under 
conditions of high stringency. 

II is likewise aimed at the use of the said 
sequences for the production of nucleotide probes for 
in vitro detection of mycobacteria belonging tc the 
M. tfjberculosis complex and ol oligonucleotide primers 
for the enzymatic amplification of specific sequences 
of strains which are members of the M. tuberculosis 
complex . 

The invention likewise relates to a method 
allowing the strains of the M. tuberculosis complex to 
be differentiated from one another, particularly f or 
allowing BCG to be differentiated from pathogenic 
mycobacteria of this complex. 

It likewise provides means for differentiating 
sub-groups among the strains forming the 
M. tuberculosis complex. 

In addition, it can allow m. leprae o£ Lhe 
species and of the strains beionqinq to the 
M. tuberculosis complex to be differentiated in a 
biological sample. 



WO 98/07847 PCT/FRS7/0L483 
- 6 - 

in the concent of the present invent icn, it is 
considered that the conditions of "h-igh stringency" in 
which two nucleotide sequences can hybridize are the 
conditions defined by Saiubrook et al., 1989, namely 
i temperature conditions of between {?„, minus 5°C) and 
{T w minus 30°C) and additionally preferably temperature 
conditions of between ,(T m minus 5°C) and {T m minus 10°C) 
(high stringency), T ra being the melting temperature, 
defined as being the temperature at which 503 of the 
paired strands separate. Preferentially, the conditions 
of high stringency used correspond to prehybridization, 
hybridization and washing temperatures of 6R"C and a 
prehybridization and hybridization buffer based on 5 x 
SSC (protocol recommended by Boehringer Mannheim) . 

The invention relates to d specific fragment of 
nucleic acids of mycobacteria of the M. tuberculosis 
complex localized in the intercistronic zeyion of the 
t>nnX3- regX3 system. 

As will be explained in the examples, the 
sequence SEQ ID No . 1 corresponds to an entity of 7 7 
repeated base pairs and a different number of copies 
according to the sliains of mycobacteria studied. This 
repeated seguence has an open reading frame (ORF] which 
can code for a peptide of 25 amino acids. 

ThP invention is thus aimed at a fragment of 
nucleic acids specific to the M. tuberculosis complex, 
comprising a sequence of nucleotides selected from the 
sequence SEQ ID No. 1, its complementary seguence or 
the sequences of nucleic acids capable of hybridizing 
with one of the preceding sequences under conditions ot 
high stringency. 

According to another aspect, the invention is 
aimed at a specific fragment of nucleic acids of 
members of the M. tuberculosis complex which arc 
different from bcg, especially the violent species 
M. tuberculosis, M. africanum or M. bovi 5 , comprising a 
seguence of nucleotides chosen f IUJll ^ 
sequence SEQ ID No. 2, its complementary sequence Or 
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the sequences of nucleic acids capable of hybridising 
with one of the preceding sequences under conditions of 
high stringency. 

as is actually illustrated by the examples, she 
BCG srrains can be differentiated from all the other 
strains of the M. tuberculosis complex because of the 
absence of the sequence SEQ ID No. 1 in the senX3-ragX3 
intergenic region in BCG (See Table 3, groups 4, 6, R) . 

The different nucleotide sequences of the 
Invention can be of artificial origin ox non- artificial 
origin. They can be DMA nr RNA seauences. 

They can be prepared, for example, by chemical 
synthesis , or alternatively by mixed methods including 
chemical or enzymatic modifications of sequences 
obtained by screening sequence libraries, by means of 
probes elaborated on Lhe basis cf the sequences SEQ ID 
No. i or 2. Such libraries can be prepared by the 
classics 1 techniques Of molecular biology, known to the 
person skilled in the art. 

The nucleotide sequences of the invention allow 
the production of probes or of nucleotide primers 
capable of hybridising specifically with chese, their 
corresponding RNA sequences or the corresponding genes 
under conditions of high stringency. Snob probes are 
likewise part of the invention. They can be used as an 
in vitro diagnostic tool for the detection, by 
hybridization experiments, of specific nucleic acid 
sequences of mycobacUiia belonging to the 
M. tuberculosis complex. 

Preferentially, the probes of the invention 
have at least 24 consecutive nucleotides although 
shorter probes can be equally suitable. At the maximum, 
these have the complete S enYl-r«gX3 intergenic region 
of M. tubcx-culoeis (IPL), namely two successive 
sequences SSQ ID No. 1 fcllowsd by a s.qu.nce SEO TD 
No. 2. This DNA fragment contains 218 base pairs. 

Among the specific preferred nucleotide probp* 
of the members of the M. tvbcJtcvlosis complex, the 



WO 98/07847 PCT/FR97/01483 

- 8 - 

sequence StlQ ID No, 1 in its entirety or its 
complementary sequence appear especially. 

According to another aspect, the invention is 
aimed at nucleotide probes for the detection and the 
demonstration of sequence of nucleic acids specif: r. to 
members of the M. tuberculosis complex which are. 
different from BCG, especially Che virulent opecies 
M. tubaxculosis, M. africanum or tt. bovis, as well as 
for the differential diagnosis of the presence of BCG 
in a biological sample containing mycobacteria of the 
M. tuberculosis complex. 

Such nucleotide probes are obtained from a 
region of the sequence SEQ ID No. 2 surroundirg the GAG 
codon ir position 40-42 or of its complementary strand. 
Preferentially, this region has a length of 21 base 
pairs, and contains 9 nucleotides upstream and 
downstream of the specific codon of the sequence 

SEQ ID No. 2. 

Advantageously, these probes comprise the 
sequence SEQ ID No. 2 in its entirety or its 
complementary sequence. 

The probes of the invention hybridize ?rr.ording 
to the appropriate hybridization conditions, 
cot-responding to the conditions of temperature and of 
ionic strength usually used by the person skilled in 
the art. 

Preferentially, the probes of the invention are 
labelled, previously ro their use. i'or this, several 
techniques are at the disposal of the person skilled in 
the art {fluorescent, radioactive, chemiluininiscent, 
enzymatic, etc. labelling). 

According to a preferred embodiment, the probes 
are labelled with digoxygenin. Digoxygenir. (DIG) is a 
steroid hapten coupled to dUTP" for labelling DNA used 
as a probe. This technology is marketed by Boehringer 
Mannheim. After the hybridization with the probe and 
washing, zhe detection is carried out following an 
emission of a chemoluminescent signal by means of the 
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SUbsUdt'e disodium 3- (4-metnoxyspiro{ 1 , 2 dioxcthane- 
3 , 2 ' - { 5 ' -chloro } tr icyclodecsm } - 4-yl ) pheny Ipho spha te 
ICSFD) . 

The nucleotide sequences of the invention are 
likewise useful for the production and the use cl 
oligonucleotide primers for sequencing reactions or for 
enzymatic amplizication. 

The enzymatic amplification techniques are 
principally illustrated by FCK. Other similar method.? 
can, however, be used such as, for example, LCR (Ligase 
Chain Reaction) , NASBA (Nucleic Acid Sequence Based 
Amplification), Q-Preplir.ase, SDA (Strand Displacement 
Amplification) and any other variant comprised in the 
technological knowledge of the person skilled in the 
art. These nucleic acid amplification techniques use 
oligonucleotide piimer molecules to initiate the 
elongation reaction ol the target sequence. The exact 
length of these primers will be able to vary according 
to the case. For example, as a function of the 
complexity of the sequence of the matrix, a 
polynucleotide primer typically contains Cjloiu 15 to 25 
nucleotides or more. In certain cases, however, it can 
contain less. 

Preferentially, the nucleotide primers of the 
invention comprise ar least 19 nucleotides. They are 
formed by piimers chosen from sequences adjacent to the 
senAJ-regXJ inleigenic region, in the regions 3« of 
senX'3 and 5' of zegX3. 

According to a preferred variant, the pair of 
primers referred to as C5 and C3 are used, namely 
5 ' GCGCGAGflGCCCGflACTGC3 ' and 5 • GCGCAGCAGAAACGTCAGC3 • 
corresponding respectively to the 3' end of the atinX3 
gene and the 5' end of the reoStt gene. These primers 
re3 F Gctively hybridize 56 base pairs upstream of the 
interciatronic region and 62 base pair S downstream of 
the latter. 

The nucleotide sequences *nd the probe* 
resulting therefrom can be cloned in cloning and/or^ 
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expression vectors according to classical techniques of 
molecular biology, especially involving the use of 
restriction enzymes and of specific cleavage sites. 

A preferred cloning vectcr in the present 
invention is represented by the plasmid pRegXMtl 
deposited at the CNCH under the number 1-1766 whose 
construction is described in greater detail in the 
examples below. 

Another cloning vector according to the 
invention is represented by the plasmid pRcgX3Bcl 
deposited at the CNCM under the numbex 1-1765. These 
plasiuids ha ve each been introduced into the bacterium 
£. coll XLl-blue. 

The vectors 1-1765 and I-1766 each contain zhe 
complete senX3-iegX3 genes with the i ntercistronic 
regions of BCG and of M. tuberculosis respectively. 

The nucleotide sequences according to the 
invention can additionally be expressed in appropriate 
systems, for the production and the study of the 
biological activities ot the corresponding peptides. In 
this case, these will be placed under the control of 
signals allowing their expression in a cell host. 

Ar. efficient sysrem of production ot a protein 
or of a recombinant peptide necessitates having a 
vectot, for example of plasmid or viral origin, and a 
compatible host cell. 

The cell host can be chosen from prokaryotic 
systems, ~ like bacteria, or eukaryotic systems, for 
example lijce veasts, insect cells, CHO {Chinese hamster 
ovary) cells or any other system advantageously 
available. 

The vector must contain a promoter, translation 
initiation and termination signals, as well as the 
appropriate regions of transcription regulation, it 
ttUOt bs 3ble to bo- maintain^ .,r*h1y in the celi and 
can possibly have particular signals specifying the 
secretion of the translated protein. 
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These different control signals are chosen as a 
function of the cell host used. To this end, the 
nucleotide sequences according to the invention can be 
inserted into autonomic replication vectors in the 
chosen host, or integrative vectors of the chesen host. 
Such vectors will be prepared according to the methods 
currently used fcv the person skilled in the art, and 
the clone? resulting thereftoni can be introduced intc 
an appropriate host by standard methods, such as, for 
example, elactropo ration. 

The in vitro diagnostic of detection methods in 
which the nucleotide probes obtained from sequences of 
the invention arc employed are likewise part of the 
present invent ion. 

More particularly, the invention is aimed at a 
method of detection of strains of mycobacteria 
belonging to the M. tuberculosis complex in a 
bioingical sample comprising the following steps: 

(i) contacting the biological sample- with a 
pair of primers under conditions allowing hybridization 
of the said primers to the nucleic acids specific to 
strains of mycobacteria belonging iu the 
M. tuberculosis complex; 

(ii) amplification of the said nucleic acids; 

(iii) contacting a nucleotide probe according 
to the invention with the said biological sample under 
conditions allowing the formation of hybridization 
complexes between the said probe and' the amplified 
nucleic acid sequences, - 

(iv) detection of the hybridization complexes 

formed. 

Recording to a first variant, the method 
according to the invention allows the presence of any 
member of the M. tuberculosis complex to be detected In 
a biological sample, m this case, rhe complexes of 
step (iii) defined above are formed with a specific 
nucleotide probe of the sequence SEQ ID No. 1 or of its 
complementary strand. 
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According to a second advantageous variant, the 
method of the invention allows the presence cl members 
of the M. Laberculoaic complex other than BCG Lo be 
specifically detected. According to this variant, the 
complexes of step (iii) which ar« detected are formed 
wir.h a nucleotide probe specific to the sequence SEQ ID 
Mo. ? or to its complementary strand, consisting 
preferentially of a shoit sequence composed of two 
times 9 base pairs framing the GAG codon at the 
specific positions 40 to 42 of the sequence SEQ ID 
No. 2. 

According to a third particularly advantageous 
variant, the method of the invention allows 
differential diagnosis of the BCG and of other members 
of the complex. In this case, the method consists at 
first of demonstrating nucleic acids specific to ail 
the members of the M. tuberculosis complex by detection 
according ZO step iiv) of the hybridization complexes 
formed with a first specific nucleotide probe o* the 
sequence SEQ 1U No. 1 oi of its complementary strand, 
then in finding among che amplified nucleic acids 
capable of forming complexes with the said first probe 
those which can likewise rorm complexes with a second 
specific nucleotide probe of SEQ ID He. 2 or of its 
complementary strand. 

The amplified nucleic acid sequences 
hybridizing uniquely with the first prcbe correspond to 
specific nucleic acid sequences of the BCG, aithougn 
the sequences hybridising with each of the two probes 
correspond to sequences of the other members of the 
M. tuberculosis complex. 

This method of differential diagnosis is of 
obvious interest with respect to the conventional 
methods of detection, because it allows an infection by 
BCG tpessibly from a vaccination) to be differen Lifted 
from that by a virulent mycobacteria. of the 
M. tuberculosis complex la. LuberculosLs, M. bovis or 
M. zfricanum, and possibly «. microti). This 
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distinction is particularly important in 
immunodeficient individual's, especially subjects 
inf°r.ted bv HIV. 

According lo another preferred embodiment, the 
invent inn provides a method of identification of groupa 
of mycobacteria belonging to the M. tuberculosis 
complex, characterized in that: 

- the DNA Of the said strains previously 
extracted with a pair of primers such aa defined above 
is contacted under conditions allowing a specific 
hybridization of the said primers with their 
corresponding sequences on the DNA of the said strains 
and the obtainment of amplification products, and 

- the length of the amplification products 
obtained is measured, for example by agarose gel 
electrophoresis . 

Advantageously, the primers 5 1 GCGCGAGAGCCCG- 
AA.CTGC3 ' End 5 ' GCGCAGCAGAAACGTCACC3 ' ■ are used in this 
method . 

The invention likewise relates to a kit for the 
in vitro identification of slrains of mycobacteria 
belonging to the M. tuberculosis complex in a 
biological sample comprising: 

- a pair of primers according to the invention, 
as defined above; 

the reagents necessary to eliow the 
amplification of the specific sequences , of nucleic 
acids belonying to the M. tuberculosis complex with the 
aid of the said primers, 

- possibly means for revealing the amplified 
fragments, preferentially a nucleotide probe of the 
invention. 

Other characteristics and advantages of the 
invention are illustrated bv the examples following the 
description as well. as by the tigures whose legends dI6 
indicated below. 
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Figure 1: "Southern Blot" analysis of the DMA 
ol BCG (IPP). A) . The fragment of 259 base pairs 
comprising a part of the regX3 gene was used as a pmbe 
to detect the gene in different restriction fragments 
in the chromosomal DMA. of the BCG {If?} . The 
restriction enzymes used are indicated at the top of 
the figure. 

B) . Partial restriction map of the locus of the senX3 
and regX3 genes of the DCC (IPP). The restriction sites 
are indicated with respect to the senX3 and zeqX3 
genes, as well as the probe used. 

Figure 2: Nucleotide sequence of the senX3 and 
rcgX3 genes of the BCG (IPP) and derived protein 
sequences. The arrows indicate palindromic sequences . 
The putative Shi n-Dalgarno sequence in underlined in 
dots (SD) . The predicted transmembrane sequence of 
SejiXS is doubly underlined. The regions of 5enX3 
consexv/ed in other sensors are underlined s.nd 
annotated: H, region containing modified histidine; N, 
reqion rich in aaparagine; F, region rich in 
phenylalanine; Gl and G2, regions rich in glycine. The 
small vertical arrows indicate the residues which are 
predicted to be phosphcrylated. The annotated sequence 
PgmY indicates the end of the gene encoding 
phosphoglycerate niutase. 

Figure. 3: hydrophobicity profile of SenX3. The 
positive values indicate hydrophobic xeyior.s and the 
negative values indicate hydrophilic regions. The 
arrows indicate the possible initiation codons and the 
two horizontal lines indicate predicted transmembrane 
regions. The numerals at the top of the figure indicate 
the numbers of amino acids. 

Figure 4: comparison of the interr.istronic 
region between the BCG {IPPJ and M. tuberculosis (IPT.) . 
The senX3 and regXS genes are indicated by the arrows. 
The nucleotide sequences are indicated as well as the 
inferred protein sequences. 
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Figure 5: Southern blot analysis of DNA of 
M. tuberculosis and of BCGL The chromosomal DNA Of 
M. tuberculosis (IPL) (on Che right) and of BCG ' (IPP) 
(on the left}, was digested by Pstl, subjected to 
electrophoresis in agarose and analyzed by 
hybridization with the senX3-regXi intergenic probe of 
H. tuberculosis (IPL) . 

figure G: analysis by electrophoresis in 
agarose (2.5%) of the products obtained by FCR carried 
out on different mycobacterial strains. Lanes L to 3 ; 
group 1, respectively M. microti, M. tuberculosis V8Q8, 
M. r.nberculosis V761; lanes 4 and 5, group 2, 
respectively M. tuberculosis V729, M. bovis 60; lanes b 
to 8, group 3, respectively M. bovis S3, M. bovis 78 
and M. bovis AN5; lanes 9 and 10, group 4, respectively 
M. tuberculosis R37Ra (IPL) and M. tuberculosis H37r v 
(IPP); lanes 11 and 12, group 5, respectively 
M. tuberculosis (IPL), M. bovis 76/ laned 13 and 14, 
group 0, respectively M. bovis (SCGite 29) and M. bovis 
BCG ;i?P), lane IS, group 7, M. tuberculosis No. 19. 

Figure 7: Southern blot analysis of the 
rragments obtained in Figure 6. The probe used is the 
senX'J-regX3 inteicistronic region of M. tuberculosis 
(IPL) . 



EXAMPLES 



EXAMPLE 1: Genomic .nap and cloning of the 
senX3-regX3 genes of M. bovis BCG (IPP) , coding for the 
two-component mycobacterial system. 

Wren at al. (199?) amplified a tragment of 259 
base pairs from the gene of M. tuberculosis { IPL) which 
rhey called rcgX3. 

The authors of the present invention amplified 
the corresponding sequence from M. bovis BCG (vaccine 
=train 1173F2, obtained at the WHO collection of 
SLuckholm, Sweden) using the following synthetic 
oligonucleotides: 5 ' -CGAGGAGTCCCTCGCCGATCCGC-3 ' and 5'- 



VTO 98/07847 PCT/FR97/01483 
- 16 - 

AGCGCCCCAGCTCCACCCGACC-3 * . The amplif icaLion was 
carried out by using these" primers and a. Deep Vent 
polymerase (New England Biolabs) . The amplification 
product of 253 base pairo resulting was then purified 
by electrophoresis on agarose gsl and then sub-cloned 
in the Smal site of the pBluescript KS+ vector 
(Str=ttagene) according to standard protocol.? (Sambrook 
et a.1., 1989). This inseit was then removed from the 
recombinant plasmid by digestion with BamHI and £coRI 
and then labelled with dL'T?("a- v P] by "random priming" 
(random labelling) using the "raiidoia priming" kit 
marketed by Boehringer according to die conditions 
iecommended by the manufacturer. The genomic DNA of 
M. buvis DCG (IPP), cultured in Sauton medium (Sauton, 
1912) at 37 °c in flasks for stationary culture of 
tissues, was extracted as described previously (Kremer 
et ai., 1995a) and digested by differer. 1 - restriction 
enzymes. This DNA was then subjected to electrophoresis 
nn agarose qel and to Southern Dlot analysis according 
to the standard methods (Sanbirook er al . , 1989). The 
testing of the blot with the -labelled DNA showed 

that digestion by Kpnl with either SainKI, EcoD.1 or Pstl 
produced pairs of hybridizing bands, with one band 
labelled more intensely than the other in each of the 
pair- (Fig. la). These bands were attributed to the 5' 
end 3' sequences respectively, of the unique asymmetric 
Kpnl site of the probe. This allowed the BanKl, tcoRI 
and Pstl 5' and 3' sites of the Kpnl site to be 
localized (Fig. lb). The digestions with only Pstl gave 
a single hybridization band. As could be predicted, the 
digestion with Kpnl gave two bands and the digestion 
with Bami and EcoRI lad to the obtainment of a single 
band oi approximately 3.5 kb. 

The genomic DNA of the bcg was digested with 
BamHl and EcoRI, and DNA fragments of 3 to 4 kb were 
isolated after electrophoresis on agarose yel. These 
fragments were inserted into pBl ue5 cxipt sk 
(Stratagene) restricted by Bami and EcoRI . 900 
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recombinant clones were screened with the ["PJ -labelled 
probe by the said standard * technique of "colony blot 
hybridisation" (Sambrook et al . , 1989). Three of them 
were positive and turn out to contain the same 
EamHI/CcoRI DNA insert 01 3.2 Kb by restriction 
analysis. A hybridizing clone was isolated fcr the 
latter studies and was named pRegX3Bci (1-1765) . 

EXAMPLE 2: Sequence of the genes of M. bovis 
BCG (1PP) ceding for the two-component system 

ihe Sail fragments cf 1.0, 1.5 and 0.'/ Jcb were 
isolated from the 3.2 kb insert of pRegX3Bci (1-1/65;, 
and sub-clcned in pBluescript SK-. The sequence of the 
3.2 kb insert, as well ds that of the sub-clones Sail 
was determined on the two strands by the method cf 
"primer walking". The nucleotide sequence of 3208 bp of 
M. bovis BCG (IPP) (Fig. 2) showed the frequency of C + 
G characteristic of the mycobacteria (66. CI). Two 
principal open reading frames (ORF) designated by regX3 
and senX2 were identified (Fig. Ib and 2) . The j.egX3 
ytne starts with an ATG triplet at the 16~>9 position 
and ends with a TAG triplet at the 2360 position. It 
contains the sequence of the JJ P-lab' :i l lad probe and 
codes for a protein whose inferred sequence of amino 
acids is 227 residues for a calculated molecular mass 
Of 24,881 Da. The limit upstream of the se/iX3 ORF is 
not certain because five in-phdse potential initiation 
codons (ATG or GIG) were found - in a short distance 
(from 296 to 446] (Fig. 2 and 3). However, only the CTC 
at position 296 is preceded with 9 nucleotides 
upstream, by g sequence homologous to the shine- 
Dalgarno sequence of Fscherichia coli (4 of the 6 
residues arc identical to AGGAGG) . Thus, zhis GTG codon 
could be the limit in 5 ' of the sphX3 ORF which ends 
with the TGA stop codon at the position 7 525. This ORF 
is thus presumed to code for a protein, of 44,769 Da 
composed of 410 amino acid residues. The senX3 gene is 
preceded at 273 nucleotides upstrgam by the 3' part of 
an ORF encoding a homologuc of the phosphoglycerate 
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mutases. A sequence capable of forming a hairpin 
structure is localized between the positions 17 8 and 
194. This sequence could function as a terminatory 
transcription sequence Ot the ORF upstream. 

A search on the MycDB database shoved that the 
sei$L3 and regX3 genes are also present in M. leprae. 
The pxoduct3 which they encode are respectively similar 
to 62.7=5 and 94.9?; to the proteins senX3 and RagK2 of 
H. bovls (IPP). The esnX3 05F of M. leprae turns out to 
be initiated by a GTC codor. and is preceded by an ORF 
coding ror a humologue of phosphogl ycerate mutase, 
similar to that which was f ound in M. voxri « BCG (IPPJ . 

EXAMPLE 3: Cloning and sequencing of the senXJ- 
regX3 gf=nes of W. CL?bercL>iu;>is {I PL) . 

The senX3 and regX3 genes of M. tuberculosi s 
(IPL; were cloned by : J CR frou chromosomal DMA of M. 
tuberculosis P2962067, z clinical isolate from the 
collection of mycobacteria of the InsLitat Pa3teur de 
Lille. The DNA of M. tuberculosis { IPL? w d s extracted 
by the method described above for Che extracl-on of 
chromosomal DNA of the BCG (Kremer et ai., 1935a). The 
fragment of 2.2 kb containing the sanX3 and regX3 genes 
of M. tuberculosis (IPL) was amplify <=-d by PCR using the 
following px liners ; homologous to the adjacent sequences 
Of the senX3 and tecfA2 genea of hi. bovis BCG fTPP); 5'- 
TGGCGTAGTGTGTGACTTGTC-3 ' and S' GACC AGACAGTCGCCAAGGT T - 
3'. The amplified fragment was cloned in the Smal site 
of pRluescript SK- (version II) to produce a plasmid 
named pRegX3Ktl (1-1766). The total figment of 2,2 kb 
was then sequenced using the same stxaiegy &5 that 
described in F.xample 2 for the senX3 and reyX3 genes of 
the BCG (IPP). The DNA sequence of che senX3 and x*gX3 
ORF of M. tuberculosis (IPL) as well as the 5' region 
upstream of sen.v.3 and the 3' region downstream of regX3 
were identical to thos« of the *r.G (TPP). Howevgr. the 
intercistronic region b e tw= en SA „X3 and re^X3 
demonstrated interesting differences which nrft studied 
in the example below. 
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EXAMPLE 4: Analysis of the senX3-regX3 
intcrcistronic region 

The intarcistronic region between senX3 and 
reg-X.3 contains a perfect duplication of 11 base pairs 
) in tandem in the BCG (IPF) (Fiy. 4). Each repeated 
sequence contains a short ORF which has the capacity to 
code, for a peptide of 25 amino acids. The ATG 
initiation codon which can he inferred from Lhe first 
repetition overlaps the TGA stop codon of senX3. This 
repeated sequence end3 with two stop codons in-phase, 
which overlap Lhe ATG initiation codon inferred from 
the following repetition. The ORF of the second 
^petition also ends with a double TGA shop cocon, 
which overlaps the ATG start codon of regX3. 

The intercistronic region of M. uuberculofiis 
(IPL) is longer and contains a third repeated sequence 
which, however, is not complete. It contains a short 
in-phase interna] deletion Of the nucleotides 4 0 to 6S 
which arc replaced hy GAG. The ORF ol this third 
repetition also ends with a double stop codon 
overlapping the ATG codon of regX3 . 

The sequence of the intercistronic region of m. 
leprae is shorter than that of BCG (IPP) . it contains 
52 base pairs. 

The exlstance of these structures in repetition 
in the SenX3-regX3 intercistronic region led the 
inventors to find out whether they were present in 
other regions of M. tubeiculosis (IPL) or in the 
chromosome of M. be vis BCG (IP?) by Southern Plot 
analysis. The serX3-regX3 intercistronic region of M. 
tuberculosis (TPT.J was obtained by enzymatic digestion 
of P RegX3Mt2 with Kr.oKl and BamHI, The resulting DNA 
fragment of 491 bass pairs was then digested with Bsrl- 
Alul to produce a DNA fragment of 218 base pairs which 
corresponds to the . «M*3-«flflC3 intergen<c fragment of 
M. tutorculosis <i Pt) . This DNA figment was then 
labelled at random with digoxygenin-dUTP (kit cat 



No. 10D3657) according zo the recommendatio: 
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manufacturer (Boehringer Mannheim) . pRegX3Mt2 was 
produced bv a first amplification by PCR of a fragment 
of chromosomal DNA of 471 base pairs of M. tUbeiCUlOSlS 
22962067 (IPL) using the following oligonucleotides: 
5' AAACACGTCGGGGCTAftTCA 3' and 5' CCTCAAAGCCCCTCC-TTGCGC 
3' and the resulting amplified fragment was than cloned 
in the Smal site of pBluescript KS- . 

The chromosomal DNA of M. luberculoaic (IPL) 
was then totally digested with PstI and subjected to 
electrophoresis on agarose gel and analysed by Southern 
BloL according to the customary procedures (Sautbrook et 
al., 1989). The labelled probe was then used for 
hybridization as indicated below. 

The probe was at first denatured by bciling for 
5 minutes, and the netnbranc was incubatpd with the 
probe overnight at 6S°c / after a prshybridi zation for 
two hours at 68"C. The buffer used for the 
prehybridization and Che hybridisation was 5*SSC 
(Sambrook et al., 198y) , 0.1% N-laurylcarccsinc (w/v) ; 
0.023, SD5 (w/v); 1% blocking reayent (Boehringcr 
Mannheim) . The membrane was then washed twice for five 
minutes in 2xSSC (Sanbrnok et al., 1989) and 0.1* SD3 
at ambient temperature and twice ror 15 minutes in 
O.ixGGC, 0.1% SDS at 68°C. The hybridized probes weie 
then immunologically detected with antidigoxyqenin Fab 
fragments conjugated to alkaline phosphatase and a 
cheiriilumiiiescent CSPD substrate (Boehringer Mannheim) . 

AS can be seer, in Figure 5, different copies of 
the senX3-regx3 intercistronic region arc presant in m. 
tuberculosis (IPL), as well as in M. bovis BCC (IPP). 

EXAMPLE 5: Amplification by PCR of the senX3- 
-r<=gX3 intercistronic region of different strains of 
mycobacteria. 

Given that the intercistronic region separating 
the s< ?nX3 and regX3 genes has variations in length 
between M. bovis BCG (IPP), M. tuberculosis (IPL) and 
M. leprae, the authors of the present invention 
analysed the corresponding region in other strains of 
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K. bote's (including uon-BCG strains) and of W. 
tubarculo.si They likewise ar.alyaed other species of 
mycobacteria: (i) the Othet members of the M. 
tuberculosis complex: M. africanam and M. microti and 
(ii) the mycobacteria other than those of the M. 
tuberculosis complex. 

The strains analysed are indicaLed in Tables 1 
and 2. Their chromosomal DMA was prepared as indicated 
below. The mycobacteria were recovered by 
cenirif ligation (300 revs/min; 30 minutes!, starting 
trom 100 ml of culture and incubation for I hour at 
37°C m 10 ial of phosphate buffer (0.4 M sucrose, iO mM 
EDTA, Tris-HCl (pH 8) 10 mM, 4 mg/ml of lysozyme). The 
protoplasts obtained were recovered by centri fugation 
(30n0 revs/min; 20 minutes) and then lysed by placing 
them to incubate ror 1 hour at 60 °C in 6 ml of L buffer 
!10 mK NaCl, SDS 6%, 10 mM Tris-HCl (pH 0), 500 ug/ml 
of proteinase K) . After addition of 1.5 ml of 5 M NaCl, 
the mixture was centrifuged (14,000 re^/mm., 20 
minuter) . The supernatant was subjected to extraction 
with phenol-chloroform and the DNA was precipi tated 
with isopropanol. The resnspended ' pellet was then 
Lzedted with RNAcc, extracted with phenoi-cnioro forir. 
and chloroform and then precipitated with ethanol. The 
pellet was then dried in air and resuspendAd in 100 ul 
of TE buffer (10 mM Tris-IICl, 1 mM EDTA, pH 7.6). 

The primeta for PCR analysis were chosen from 
the DNA sequence of Lhe senX3 and regX3 genes of M. 
bovis BCG (iff) and M. tuberculosis (I?L) . A primer 
ICS) hybridized to the 3' end of Lhe senX3 gene and had 
the following 5'-3' sequence: 5 ' GCGCGAGAGCCCGAACTGC3 ' . 
The other primer (C3) hybridized to the 5' end of the 
regX3 gene end ban the following S'-3' sequence: 5'- 
gcgcagcagaaacgtcAGC-3' . With these two priir.ers, the pgr 
product comprises 56 additional base pairs at the 5' 
end cf the intergenic rsgion and 52 additional bass 
pairs at its 3' end with respect to the intercistronic 
region. The PCR product has a length of 369 base pairs 
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lor M. tuberculosis 22962067 (lfL) and 276 base pairs 
for W. bo vis BCG {IPPJ - 

The TCPv amplifications were carried out in a 
Hhsrnocycler (Perkin Elmer) by intubation Ol 1 ul of 
chromosomal DNA with the following reaction mixture: C5 
and C3 oligonucleotides (1 ug/"i jil of each j , dTNP 
(2 fil) 25 itiM/ TMACl (tctramethyl ammonium chloride) 
(2 jil) 5mM, 10 x enzymatic buffer {10 jil) , Vent UNA 
polymerase (l U/0.5 ul) , watar (R? ul) . The 
amplification was carried out over 30 cycles at 94°C 
for one minute, 65°C for one minute and 72 °C for one 
minute. ID jil of PCR product were then subjected no 
electrophoresis on 2.5% agarose gel and visualized with 
ethidiuro bromide. The negative controls containing all 
the PCR reagents except the DNa matrix were tieated in 
parallel with the samples. The length of the PCR 
product was estimated by comparison with the DNA scale 

of 1 :<b. 

No FCR product was detected for the 11 strains 
of ton- tuberculosis mycobacteria tested (see Table 2) . 
Three strains of Streptomycea (S, czcaoi, S. Rfil and S. 
R3S), as well as E. coli TGI likewise gave negative 
results in FCK. On the other hand, PCR products were 
nhtained with all the member s of the M, tuberculosis 
complex. The specificity of the PCR products waa 
confirmed in all the cases by Southern Blot analysis 
using as probe the senX3-regx3 intergenic reyion of M. 
tuberculosis (IPT.) labelled with digoxygenin, di 
described in Example <3 . 

The length of the amplified fragments is 
indicated in Table 3. Eight different groups with 
different amplicon lengths were obtained. Out of the 35 
clinical isolates of M. tuberculosis tested, 3a gave 
PCR fragments of 320 base pairs which could not be 
distinguished from . that obtained with rhe reference 
slidin M. tuberculosis 229S2067 (IFL) (group 5). One 
strain of M. tuberculosis (No. 13) ga v e a rC R produce 
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of 254 base pairs (group 8) . W. tuberculosis S200 also 
belongs to group 5. 

Strains of M. rnherculosis from Vietnam and not 
containing the IS6L10 sequence (V. 80S, V. 761 and 
V. 123) were different from the maior M. tuberculosis 
group (group 5) . The length of their PCR product 
exceeded 500 basi pairs (one product or +/- 500 base 
pairs and two of 560) . The laboratory strains of M. 
tuberculosis H37Ra and il37Rv had PCR products which 
wer<= slightly larger than those obtained in grcup b of 
the strains Ot M. tuberculosis and these two strains 
are classified among group 4. These two strains are odd 
in as far s.s the uncertain pathogenicity for H37Rv and 
the loss of pathogenicity of H37Ra ate concerned. In 
addition, no clinical case of M. tuberculosis was 
present in groups A, 6, 8: these three groups are thus 
specific for the non-virulent strains. 

The 5CC strains (vaccine strains and isolates 
of clinical cases of BCGites) were divided into three 
groups (No. 4, 5 and 2) . The PCR products obtained from 
these strains had s length of 353 base pairs, 27 6 and 
19S> respectively. 

The most important variability at the level of 
the length of the PCR fragments was encountered for the 
non-BCG strains of M. bovis. Six strains were tested. 
Three of them, including the reference strain ANS, gave 
PCR products of 406 base pairs (group 3) . The three 
other strains were grouped in groups 2, 5 and 7 
corresponding to PCR fragments of approximately 500 and 
of 329 or 2 54 base pairs respectively. 
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Table 1: H. 



tvib&xculoais complex 



M. nuhercvlosis 



Species 



M. sfricanum 
M. microti 



3trains 



reference (22962057) 

- H37Ra 

- H37P.V 

- S200 

- 35 clinical isolates 
(different profiles of 
IS6110 RFLP) 

3 clinical isolates 
from Vietnam : V808, 
Y761, V7 29 



- BCG ( vacclue strains) ; 
BCC IPP 

BCG Moreau 
ECG Japonicus 
BCG Pragues 
BCG Montreal 
BCG Russe 
RCG Glaxo 

- case ul BCGitca : 

l clinical isolates 
(28, 29, 30, 3D 

- gon BCG : 
refsrsnce ANS 

1 clinical isolate (1) 

2 isolates from gnats 
(60. 631 

2 isolates from cows 
P6, 78) 

- clinical isolate 

- reference ATCC 19422 



I PL 
TP?" 

£pp (MdLChaD 
CHS. of Lille"" 
CHR of Lille 



IPP (Marchal) 



IPP 
IPL 
IPL 
IPP 

■ IPP 
IPP 

• IPP 



- IPL 

- CHR of Lille 

- lft (Marchal) 

- CHR of Lille 

- University ot 
Zarayo^a (Martin) 

- University of 
Zaragoza (Martin) 



■ CHR of Lille 
• IPP (M=rchal ) 



Ins ti tut Pasteur de Lille 
Inatitttt Pasteur de Paris 
Centre Hospitaller Regional de Lille 
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Table 2: Atypical mycobacteria 



Species end strains 


Sources 


M. curum 


- clinical 


isolates 


(IPL) 


M. avium 


- clinical 


isolates 


(IPL) 


M. ciielonnn 


- clinical 


isolates 


(IPL) 


M. flavescens 


- clinical 


isolates 


(I FT.) 


M. fGTtuivnm 


- clinical 


isolates 


(IPL) 


M. kanxzii 


- clinical 


isolates 


(CHR of Lille) 




- clinics! 


isolates 


(IPL) 


M. scrozulaceum 


- clinical 


isolates 


{ T PL) 


M. smegmatis 


- IPL 






M- terse 


- clinical 


isolates 


(IPL) 


W. xenopi 


- clinical 


isolates 


(IPL) 



ry 



to o 
•i Cr> 

CD — 



8 



to w 
CJ u 



« c c - 

H _ 

_j u 

f-: -3 T3 



1 xj 
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The amplified DNA fragmants comprise the senX3- 
r<=gX3 intercistronic region as well a? 56 base pairs 
upstream of this and 62 base pairs downstream. The 
arrows 77 and b'3 designate the repeated elements found 
in each group. The strains Cur which the intercistrnnic 
region was sequenced are underlined. 
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SEQUENCE LISTING 



PCT/FR97/01483 



(1} GENERAL INFORMATION: 

(i) APPLICANT: 

( A) NAME: Institut Pasteur de Lille 

(Bl GTR2ET: 1 rue dv Professeur Calmette 

(C) CITY: Lille 

(E) COUNTRY: France 

(F) POSTAL CODE: S9019 

(G) TELEPHONE: 032C8779O0 

(H) TELEFAX: 0320077906 

(A) NAME: Instil Jt National dc la Sante e 
de ia recherche juedicale 

(B) STREET: 101 rue ce Tolbiac 

(C) CTTY: Paris 

(D) COUNTRY: France 

(F) POSTAL CODE : 75013 

(G) TELEPHONE: 0144236039 
(II) TELEFAX: 014 58S07fin 

(ii) TITLE OF THE INVENTION: frogmen's of 
specific nucleic acids of M. tuberculosis 

(III) NUMBER OF SEQUENCES: 2 

(IV) COMPUTES. READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
(E) COMPUTER: IBM PC compatible 
(C) OPERATING SYSTEM: PC- DOS /MS-DOS 
(n) SOFTWARE : Patentln Release Jtl.o, 
Version 81.2b (EPO) 

(2) I NFORMAT I ON FOP. .SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 77 base pairs 
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(D) TYPE : nucleic zc.id 

(C) STRANDEDNESS : single 

(D) CONFIGURATION : linear 



(ii) TYPE OF MOLECULE: DNA 'genomic) 

(Vi) ORIGINAL SOURCE : 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: 5EQ ID NO: 1: 

ATGACCTGCG CCGACGACGA TGCAGAGCGT AGCGATGAGG TGCGGGCACC 
ACCCCCTTGC GGGGGAGAGT GGCGCTG 



(2] INFORMATION FOR SSQ TD NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) CONFIGURATION: linear 



(ii) TYPE OF MOLECULE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(:<i) SEQUENCE INSCRIPTION: SEQ ID NO: 2: 

ATGACCTGCG CCGACGACGA TGCAGAGCGT AGCGATGAGG AGGAGTGGCC- 
CTG 



